Fix deadlock in ThreadPoolMergeScheduler when a failing merge closes the IndexWriter #134656

tlrx · 2025-09-12T16:57:49Z

This change fixes a bug that causes a deadlock in the thread pool merge scheduler when a merge fails due to a tragic event.

The deadlock occurs because Lucene aborts running merges when failing with a tragic event and then waits for them to complete. But those "running" merges might in fact be waiting in the Elasticsearch's thread pool merge scheduler tasks queue, or they might be waiting in the backlogged merge tasks queue because the per-shard concurrent merges count limit has been reached, or they might simply be waiting for enough disk space to be executed. In which cases the merge thread that is failing waits indefinitely.

The proposed fix in this change uses the merge thread that is failing due to a tragic event to abort all other enqueued and backlogged merge tasks of the same shard, before pursuing with the closing of the IndexWriter. This way Lucene won't have to wait for any running merges as they would have all be aborted upfront.

Relates ES-12664

elasticsearchmachine · 2025-09-12T16:58:36Z

Hi @tlrx, I've created a changelog YAML for you.

…nto 2025/09/09/ES-12664

henningandersen

Overall looks good. I left some suggestions that I hope to get your input on. I think avoiding the pending merge list is a simplification.

server/src/main/java/org/elasticsearch/index/engine/InternalEngine.java

server/src/main/java/org/elasticsearch/index/engine/ThreadPoolMergeScheduler.java

server/src/main/java/org/elasticsearch/index/engine/InternalEngine.java

server/src/main/java/org/elasticsearch/index/engine/ThreadPoolMergeExecutorService.java

server/src/main/java/org/elasticsearch/index/engine/ThreadPoolMergeScheduler.java

tlrx · 2025-09-19T10:47:36Z

@henningandersen Thanks a lot for the feedback, I borrowed your code with minor changes.

I had a couple hundred of successful tests execution locally but I'll continue to run more before merging. This is ready for review.

tlrx · 2025-09-19T15:06:18Z

@henningandersen I updated the PR with your feedback, let me know if you have more comments to do.

I'll run more tests locally and on CI just in case, but so far looks green.

henningandersen

LGTM.

lkts · 2025-09-19T18:27:45Z

This should be backported, right?

henningandersen · 2025-09-21T07:33:21Z

This should be backported, right?

Yes, to 8.19, 9.0, 9.1, IIUC.

tlrx · 2025-09-22T07:31:59Z

Thanks @henningandersen for the help!

This should be backported, right?

Yes, to 8.19, 9.0, 9.1, IIUC.

Will backport today.

…the IndexWriter (elastic#134656) This change fixes a bug that causes a deadlock in the thread pool merge scheduler when a merge fails due to a tragic event. The deadlock occurs because Lucene aborts running merges when failing with a tragic event and then waits for them to complete. But those "running" merges might in fact be waiting in the Elasticsearch's thread pool merge scheduler tasks queue, or they might be waiting in the backlogged merge tasks queue because the per-shard concurrent merges count limit has been reached, or they might simply be waiting for enough disk space to be executed. In which cases the merge thread that is failing waits indefinitely. The proposed fix in this change uses the merge thread that is failing due to a tragic event to abort all other enqueued and backlogged merge tasks of the same shard, before pursuing with the closing of the IndexWriter. This way Lucene won't have to wait for any running merges as they would have all be aborted upfront. Relates ES-12664

…the IndexWriter (#134656) (#135173) This change fixes a bug that causes a deadlock in the thread pool merge scheduler when a merge fails due to a tragic event. The deadlock occurs because Lucene aborts running merges when failing with a tragic event and then waits for them to complete. But those "running" merges might in fact be waiting in the Elasticsearch's thread pool merge scheduler tasks queue, or they might be waiting in the backlogged merge tasks queue because the per-shard concurrent merges count limit has been reached, or they might simply be waiting for enough disk space to be executed. In which cases the merge thread that is failing waits indefinitely. The proposed fix in this change uses the merge thread that is failing due to a tragic event to abort all other enqueued and backlogged merge tasks of the same shard, before pursuing with the closing of the IndexWriter. This way Lucene won't have to wait for any running merges as they would have all be aborted upfront. Relates ES-12664

…loses the IndexWriter (#134656) (#135175) This change fixes a bug that causes a deadlock in the thread pool merge scheduler when a merge fails due to a tragic event. The deadlock occurs because Lucene aborts running merges when failing with a tragic event and then waits for them to complete. But those "running" merges might in fact be waiting in the Elasticsearch's thread pool merge scheduler tasks queue, or they might be waiting in the backlogged merge tasks queue because the per-shard concurrent merges count limit has been reached, or they might simply be waiting for enough disk space to be executed. In which cases the merge thread that is failing waits indefinitely. The proposed fix in this change uses the merge thread that is failing due to a tragic event to abort all other enqueued and backlogged merge tasks of the same shard, before pursuing with the closing of the IndexWriter. This way Lucene won't have to wait for any running merges as they would have all be aborted upfront. Backport of #134656 for 9.0.8 Relates ES-12664

…closes the IndexWriter (#134656) (#135177) This change fixes a bug that causes a deadlock in the thread pool merge scheduler when a merge fails due to a tragic event. The deadlock occurs because Lucene aborts running merges when failing with a tragic event and then waits for them to complete. But those "running" merges might in fact be waiting in the Elasticsearch's thread pool merge scheduler tasks queue, or they might be waiting in the backlogged merge tasks queue because the per-shard concurrent merges count limit has been reached, or they might simply be waiting for enough disk space to be executed. In which cases the merge thread that is failing waits indefinitely. The proposed fix in this change uses the merge thread that is failing due to a tragic event to abort all other enqueued and backlogged merge tasks of the same shard, before pursuing with the closing of the IndexWriter. This way Lucene won't have to wait for any running merges as they would have all be aborted upfront. Backport of #134656 for 8.19.5 Relates ES-12664

* upstream/main: (50 commits) Disable utf-8 parsing optimization (elastic#135172) rest-api-spec: fix master_timeout typo (elastic#135167) Fixes countDistinctWithConditions in csv-spec tests (elastic#135097) Fix test failure by checking for feature flag (elastic#135174) Fix deadlock in ThreadPoolMergeScheduler when a failing merge closes the IndexWriter (elastic#134656) Make SecureString comparisons constant time (elastic#135053) Mute org.elasticsearch.test.rest.yaml.CcsCommonYamlTestSuiteIT test {p0=search/160_exists_query/Test exists query on mapped geo_point field with no doc values} elastic#135164 ESQL: Replace function count tests (elastic#134951) Mute org.elasticsearch.compute.aggregation.SampleBooleanAggregatorFunctionTests testSimpleWithCranky elastic#135163 Mute org.elasticsearch.xpack.test.rest.XPackRestIT test {p0=analytics/nested_top_metrics_sort/terms order by top metrics numeric not null integer values} elastic#135162 Mute org.elasticsearch.xpack.test.rest.XPackRestIT test {p0=analytics/nested_top_metrics_sort/terms order by top metrics numeric not null double values} elastic#135159 TSDB ingest performance: combine routing and tsdb hashing (elastic#132566) Mute org.elasticsearch.compute.aggregation.SampleBytesRefAggregatorFunctionTests testSimpleWithCranky elastic#135157 Mute org.elasticsearch.xpack.logsdb.qa.BulkStoredSourceChallengeRestIT testHistogramAggregation elastic#135156 Mute org.elasticsearch.xpack.logsdb.qa.StandardVersusStandardReindexedIntoLogsDbChallengeRestIT testHistogramAggregation elastic#135155 Mute org.elasticsearch.xpack.logsdb.qa.LogsDbVersusLogsDbReindexedIntoStandardModeChallengeRestIT testHistogramAggregation elastic#135154 Mute org.elasticsearch.xpack.logsdb.qa.BulkChallengeRestIT testHistogramAggregation elastic#135153 Mute org.elasticsearch.discovery.ClusterDisruptionIT testAckedIndexing elastic#117024 Mute org.elasticsearch.lucene.RollingUpgradeSearchableSnapshotIndexCompatibilityIT testMountSearchableSnapshot {p0=[9.2.0, 9.2.0, 9.2.0]} elastic#135151 Mute org.elasticsearch.lucene.RollingUpgradeSearchableSnapshotIndexCompatibilityIT testSearchableSnapshotUpgrade {p0=[9.2.0, 9.2.0, 9.2.0]} elastic#135150 ...

tlrx added 2 commits September 9, 2025 17:37

reproducer

e55f472

possible fix

a48b071

tlrx added >bug :Distributed Indexing/Engine Anything around managing Lucene and the Translog in an open shard. labels Sep 12, 2025

elasticsearchmachine added the v9.2.0 label Sep 12, 2025

tlrx and others added 16 commits September 12, 2025 18:58

Update docs/changelog/134656.yaml

877bbc6

[CI] Auto commit changes from spotless

d38acd9

Merge branch 'main' into 2025/09/09/ES-12664

bc6c302

nit

12f5597

nit

1aed9d2

nit

6d900ac

[CI] Auto commit changes from spotless

a0dbc8c

nit

adce7eb

nit

a332f74

Merge branch '2025/09/09/ES-12664' of github.com:tlrx/elasticsearch i…

e613990

…nto 2025/09/09/ES-12664

nit

7781abf

Merge branch 'main' into 2025/09/09/ES-12664

a9f15cd

Merge branch 'main' into 2025/09/09/ES-12664

596cd62

Merge branch 'main' into 2025/09/09/ES-12664

e12c711

Merge branch 'main' into 2025/09/09/ES-12664

bf0790e

Merge branch 'main' into 2025/09/09/ES-12664

4ac38b2

henningandersen reviewed Sep 18, 2025

View reviewed changes

tlrx added 6 commits September 18, 2025 13:58

Merge branch 'main' into 2025/09/09/ES-12664

ec0d6bb

suppress exception

37a3d9b

fix spin loop

4d38ecc

remove synchronized block

0eae7ac

fix test

ae99d45

Merge branch 'main' into 2025/09/09/ES-12664

3218f7f

tlrx added 4 commits September 19, 2025 15:56

feedback

97105b4

assertTrue

742dd09

latch

d3e083d

closedWithNoRunningMerges

555a846

tlrx requested a review from henningandersen September 19, 2025 15:05

[CI] Auto commit changes from spotless

7eae1cc

henningandersen approved these changes Sep 19, 2025

View reviewed changes

tlrx added 2 commits September 20, 2025 10:25

Merge branch 'main' into 2025/09/09/ES-12664

3f6c734

rarely close

5bd52a4

tlrx added v9.1.5 v9.0.8 v8.19.5 labels Sep 22, 2025

tlrx merged commit e6d78b0 into elastic:main Sep 22, 2025
33 of 34 checks passed

tlrx added the backport pending label Sep 22, 2025

tlrx deleted the 2025/09/09/ES-12664 branch September 22, 2025 08:13

This was referenced Sep 22, 2025

[9.1] Fix deadlock in ThreadPoolMergeScheduler when a failing merge closes the IndexWriter (#134656) #135173

Merged

[9.0] Fix deadlock in ThreadPoolMergeScheduler when a failing merge closes the IndexWriter (#134656) #135175

Merged

This was referenced Sep 22, 2025

[8.19] Fix deadlock in ThreadPoolMergeScheduler when a failing merge closes the IndexWriter (#134656) #135177

Merged

Fix deadlock in ThreadPoolMergeScheduler when a failing merge closes the IndexWriter #134128

Closed

tlrx removed the backport pending label Sep 22, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix deadlock in ThreadPoolMergeScheduler when a failing merge closes the IndexWriter #134656

Fix deadlock in ThreadPoolMergeScheduler when a failing merge closes the IndexWriter #134656

Uh oh!

tlrx commented Sep 12, 2025 •

edited

Loading

Uh oh!

elasticsearchmachine commented Sep 12, 2025

Uh oh!

henningandersen left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tlrx commented Sep 19, 2025

Uh oh!

tlrx commented Sep 19, 2025

Uh oh!

henningandersen left a comment

Uh oh!

lkts commented Sep 19, 2025

Uh oh!

henningandersen commented Sep 21, 2025

Uh oh!

Uh oh!

tlrx commented Sep 22, 2025

Uh oh!

Uh oh!

Fix deadlock in ThreadPoolMergeScheduler when a failing merge closes the IndexWriter #134656

Fix deadlock in ThreadPoolMergeScheduler when a failing merge closes the IndexWriter #134656

Uh oh!

Conversation

tlrx commented Sep 12, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

elasticsearchmachine commented Sep 12, 2025

Uh oh!

henningandersen left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

tlrx commented Sep 19, 2025

Uh oh!

tlrx commented Sep 19, 2025

Uh oh!

henningandersen left a comment

Choose a reason for hiding this comment

Uh oh!

lkts commented Sep 19, 2025

Uh oh!

henningandersen commented Sep 21, 2025

Uh oh!

Uh oh!

tlrx commented Sep 22, 2025

Uh oh!

Uh oh!

tlrx commented Sep 12, 2025 •

edited

Loading